Approximate Graph Coloring by Semidefinite Programming 
David Karger * Rajeev Motwani + Madhu Sudan * 



Abstract 

We consider the problem of coloring fc-colorable graphs with the fewest possible colors. We 
present a randomized polynomial time algorithm that colors a 3-colorable graph on n vertices 
with min{0(A 1 / 3 log 1 / 2 Alogn), 0(n 1 ^ 4 log 1 / 2 n)} colors where A is the maximum degree of 
any vertex. Besides giving the best known approximation ratio in terms of n, this marks the 
first non-trivial approximation result as a function of the maximum degree A. This result can 
be generalized to fc-colorable graphs to obtain a coloring using min{0(A 1_2 / A: log 1//2 A log n), 
0(n 1-3 /( fc+1 ) log 1 / 2 n)} colors. Our results are inspired by the recent work of Goemans and 
Williamson who used an algorithm for semidefinite optimization problems, which generalize lin- 
ear programs, to obtain improved approximations for the MAX CUT and MAX 2-SAT problems. 
An intriguing outcome of our work is a duality relationship established between the value of 
the optimum solution to our semidefinite program and the Lovasz ^-function. We show lower 
bounds on the gap between the optimum solution of our semidefinite program and the actual 
chromatic number; by duality this also demonstrates interesting new facts about the i9-function. 
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1 Introduction 



A legal vertex coloring of a graph G(V, E) is an assignment of colors to its vertices such that no 
two adjacent vertices receive the same color. Equivalently, a legal coloring of G by k colors is a 
partition of its vertices into k independent sets. The minimum number of colors needed for such 
a coloring is called the chromatic number of G, and is usually denoted by x{G)- Determining the 
chromatic number of a graph is known to be NP-hard (cf. |20(| ), 

Besides its theoretical significance as a canonical NP-hard problem, graph coloring arises natu- 
rally in a variety of applications such as register allocation |ll], [12|, |DJ and timetable/examination 
scheduling ||, 43|. In many applications that can be formulated as graph coloring problems, it suf- 
fices to find an approximately optimum graph coloring — a coloring of the graph with a small though 
non-optimum number of colors. This along with the apparent impossibility of an exact solution has 
led to some interest in the problem of approximate graph coloring. The analysis of approximation 
algorithms for graph coloring started with the work of Johnson [27] who shows that a version of 
the greedy algorithm gives an 0(n/ log n)-approximation algorithm for A;-coloring. Wigderson fl42|| 
improved this bound by giving an elegant algorithm that uses 0(n 1_1 /^ -1 ^) colors to legally color a 
fe-colorable graph. Subsequently, other polynomial time algorithms were provided by Blum Q that 
use 0(tj 3 / 8 log 8 / 5 n) colors to legally color an n-vertex 3-colorable graph. This result generalizes 
to coloring a /c-colorable graph with 0(n 1_1 /( fc ~ 4 / 3 ) log 8//5 n) colors. The best known performance 
guarantee for general graphs is due to Halldorsson |25| who provided a polynomial time algorithm 
using a number of colors that is within a factor of 0(n(loglogn) 2 /log 3 n) of the optimum. 

Recent results in the hardness of approximations indicate that it may be not possible to sub- 
stantially improve the results described above. Lund and Yannakakis [34] used the results of Arora, 
Lund, Motwani, Sudan, and Szegedy B and Feige, Goldwasser, Lovasz, Safra, and Szegedy [17] 



to show that there exists a (small) constant e > such that no polynomial time algorithm can 
approximate the chromatic number of a graph to within a ratio of rf unless P = NP. The current 
hardness result for the approximation of the chromatic number is due to Feige and Kilian [18] and 
Hastad |26j] , who show that approximating it to within ra 1 ^, for any 5 > 0, would imply NP=RP 
(RP is the class of probabilistic polynomial time algorithms making one-sided error). However, 
none of these hardness results apply to the special case of the problem where the input graph is 
guaranteed to be A;-colorable for some small k. The best hardness result in this direction is due to 
Khanna, Linial, and Safra |2^[ who show that it is not possible to color a 3-colorable graph with 4 
colors in polynomial time unless P = NP. 

In this paper we present improvements on the result of Blum. In particular, we provide a 
randomized polynomial time algorithm that colors a 3-colorable graph of maximum degree A 
with min {©(A 1 / 3 log 1/2 A log n), Oin 1 / 4 log 1/2 n)} colors; moreover, this can be generalized to k- 
colorable graphs to obtain a coloring using 0(A 1_2//fc log 1 / 2 A log n) or 0(n 1_3 /( fc+1 ) log 1 / 2 n) colors. 
Besides giving the best known approximations in terms of n, our results are the first non-trivial 
approximations given in terms of A. Our results are based on the recent work of Goemans and 
Williamson [21] who used an algorithm for semidefinite optimization problems (cf. p3| , Q]) to obtain 
improved approximations for the MAX CUT and MAX 2-SAT problems. We follow their basic 
paradigm of using algorithms for semidefinite programming to obtain an optimum solution to a 
relaxed version of the problem, and a randomized strategy for "rounding" this solution to a feasible 
but approximate solution to the original problem. Motwani and Naor [37| have shown that the 
approximate graph coloring problem is closely related to the problem of finding a CUT COVER 
of the edges of a graph. Our results can be viewed as generalizing the MAX CUT approximation 
algorithm of Goemans and Williamson to the problem of finding an approximate CUT COVER. In 
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fact, our techniques also lead to improved approximations for the MAX k-CVT problem fpTSf] . We 
also establish a duality relationship between the value of the optimum solution to our semidefinite 
program and the Lovasz i9-function 24, 33]. We show lower bounds on the gap between the 
optimum solution of our semidefinite program and the actual chromatic number; by duality this 
also demonstrates interesting new facts about the -^-function. 

Alon and Kahale [Q] use related techniques to devise a polynomial time algorithm for 3-coloring 
random graphs drawn from a "hard" distribution on the space of all 3-colorable graphs. Recently, 
Frieze and Jerrum [19] have used a semidefinite programming formulation and randomized rounding 
strategy essentially the same as ours to obtain improved approximations for the MAX fc-CUT 
problem with large values of k. Their results required a more sophisticated version of our analysis, 
but for the coloring problem our results are tight up to poly-logarithmic factors and their analysis 
does not help to improve our bounds. 

Semidefinite programming relaxations are an extension of the linear programming relaxation 
approach to approximately solving NP-complete problems. We thus present our work in the style 
of the classical LP-relaxation approach. We begin in Section ^ by defining a relaxed version of 
the coloring problem. Since we use a more complex relaxation than standard linear programming, 
we must show that the relaxed problem can be solved; this is done in Section ^. We then show 
relationships between the relaxation and the original problem. In Section ||, we show that (in a 
sense to be defined later) the value of the relaxation bounds the value of the original problem. 
Then, in Sections [5|, |(| and 0, we show how a solution to the relaxation can be "rounded" to make 
it a solution to the original problem. Combining the last two arguments shows that we can find 
a good approximation. Section || Section and Sections are in fact independent and can be 
read in any order after the definitions in Section ||. In Section || we investigate the relationship 
between our fractional relaxations and the Lovasz i9-function, showing that they are in fact dual to 
one another. We investigate the approximation error inherent in our formulation of the chromatic 
number via semi-definite programming in Section 0. 



2 A Vector Relaxation of Coloring 

In this section, we describe the relaxed coloring problem whose solution is in turn used to approx- 
imate the solution to the coloring problem. Instead of assigning colors to the vertices of a graph, 
we consider assigning (n-dimensional) unit vectors to the vertices. To capture the property of a 
coloring, we aim for the vectors of adjacent vertices to be "different" in a natural way. The vector 
k-coloring that we define plays the role that a hypothetical "fractional ^-coloring" would play in a 
classical linear-programming relaxation approach to the problem. Our relaxation is related to the 
concept of an orthonormal representation of a graph [p3| , [23| . 

Definition 2.1 Given a graph G = (V,E) on n vertices, and a real number k > 1, a vector de- 
coloring of G is an assignment of unit vectors Vi from the space W 1 to each vertex i E V , such that 
for any two adjacent vertices i and j the dot product of their vectors satisfies the inequality 

1 

(v h v 3 ) < j-- 

The definition of an orthonormal representation |3^, ^3| requires that the given dot products 
be equal to zero, a weaker requirement than the one above. 
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3 Solving the Vector Coloring Problem 



In this section we show how the vector coloring relaxation can be solved using semidefinite pro- 
gramming. The methods in this section closely mimic those of Goemans and Williamson pjj . 
To solve the problem, we need the following auxiliary definition. 

Definition 3.1 Given a graph G = (V,E) on n vertices, a matrix £>coloring of the graph is an 
n x n symmetric positive semidefinite matrix M, with ma = 1 and rriij < —l/{k — 1) if {i,j} £ E. 



We now observe that matrix and vector fe-colorings are in fact equivalent (cf. |21[| ). Thus, to 
solve the vector coloring relaxation it will suffice to find a matrix A;-coloring. 

Fact 3.1 A graph has a vector k-coloring if and only if it has matrix k-coloring. Moreover, a vector 
(k + e)-coloring can be constructed from a matrix k-coloring in time polynomial in n and log(l/e). 

Note that an exact solution cannot be found, as some of the values in it may be irrational. 

Proof: Given a vector A;-coloring {vi}, the matrix A;-coloring is defined by rriij = (vi,Vj). For the 
other direction, it is well known that for every symmetric positive definite matrix M there exists a 
square matrix U such that UU T = M (where U T is the transpose of U). The rows of U are vectors 
{uj}™ =1 that form a vector /c-coloring of G. 

An (5-close approximation to the matrix U can be found in time polynomial in n and log(l/<5) 
using the Incomplete Cholesky Decomposition [21, ^2|. (Here by (5-close we mean a matrix U' such 
that U'U' T — M has L ro norm less than 5.) This in turn gives a vector (k + e)-coloring of the graph, 
provided 5 is chosen appropriately. ■ 

Lemma 3.2 // a graph G has a vector k-coloring then a vector (k + e)-coloring of the graph can 
be constructed in time polynomial in k, n, and log(l/e). 

Proof: Our proof is similar to those of Lovasz |33| and Goemans- Williamson [21]. We construct 
a semidefinite optimization problem (SDP) whose optimum is —l/(k — 1) when k is the smallest 
real number such that a matrix A>coloring of G exists. The optimum solution also provides a matrix 
/c-coloring of G. 

minimize a 

where { m ij} is positive semidefinite 
subject to rriij < a if G E 

TTlij — q m,ji 

ma = 1. 

Consider a graph which has a vector (and matrix) fc-coloring. This means there is a solution to the 
above semidefinite program with a = — l/(k— 1). The ellipsoid method or other interior point based 
methods H, II 

can be employed to find a feasible solution where the value of the objective is at 
most —l/{k — 1) + 5 in time polynomial in n and log 1/5. This implies that for all {i,j} € E, rriij is 
at most 5 — l/(k — 1), which is at most —l/(k + e — 1) for e = 25(k — l) 2 , provided 5 < l/2(k — 1). 
Thus a matrix {k + e)-coloring can be found in time polynomial in k, n and log(l/e). From the 
matrix coloring, the vector coloring can be found in polynomial time as was noted in the previous 
lemma I 

For the remainder of the paper, we will ignore the e error term of Lemma since it can be 
made so small as to be irrelvant to our analysis. 



3 



4 Relating the Original and Relaxed Solutions 



In this section, we show that our vector coloring problem is a useful relaxation because the solution 
to it is related to the solution of the original problem. In order to understand the quality of the 
relaxed solution, we need the following geometric lemma: 

Lemma 4.1 For all positive integers k and n such that k < n + 1, there exist k unit vectors in ffl 1 
such that the dot product of any distinct pair is —l/{k — 1). 

Proof: Clearly it suffices to prove the lemma for n = k — 1. (For other values of n, we make the 
coordinates of the vectors in all but the first k — 1 coordinates.) We begin by proving the claim for 
n = k. We explicitly provide unit vectors v[ k \ . . . 6 ^ k ~ l such that ^ k \vj k ) < —l/(k — 1) 
for i 7^ j. The vector is ~^\J k(k- T) m a ^ coordinates except the ith coordinate. In the ith 

coordinate is \J ^r-. It is easy to verify that the vectors are unit length and that their dot 
products are exactly jp^. 

As given, the vectors are in a fc-dimensional space. Note, however, that the dot product of 
each vector with the all-l's vector is 0. This shows that all k of the vectors are actually in a 
(k-l)-dimensional hyperplane of the k-dimensional space. This proves the lemma. I 



Corollary 4.2 Every k-colorable graph G has a vector k-coloring. 

Proof: Bijectively map the k colors to the k vectors defined in the previous lemma. 



Note that a graph is vector 2-colorable if and only if it is 2-colorable. Lemma 4.1 is tight in 
that it provides the best possible value for minimizing the maximum dot-product among k unit 
vectors. This can be seen from the following lemma. 

Lemma 4.3 Let G be vector k-colorable and let i be a vertex in G. The induced subgraph on the 
neighbors of i is vector (k — l)-colorable. 

Proof: Let v\, . . . ,v n be a vector /c-coloring of G and assume without loss of generality that 
Vi = (1, 0, 0, . . . , 0). Associate with each neighbor j of i a vector v'j obtained by projecting vj onto 
coordinates 2 through n and then scaling it up so that v'a has unit length. It suffices to show that 
for any two adjacent vertices j and j' in the neighborhood of i, ^,i/ 7 ) < — l/(k — 2). 

Observe first that the projection of Vj onto the first coordinate is negative and has magnitude 
at least l/(k — 1). This implies that the scaling factor for v'j is at least — ^===. Thus, 



A simple induction using the above lemma shows that any graph containing a (k + l)-clique is 
not /c-vector colorable. Thus the "vector chromatic number" lies between between the clique and 



chromatic number. This also shows that the analysis of Lemma |4.l| is tight in that — l/(k — 1) is 
the minimum possible value of the maximum of the dot-products of k vectors. 

In the next few sections we prove the harder part, namely, if a graph has a vector ^-coloring 
then it has an 0(A 1_2//fc ) and an 0(n 1_3 /( fc+1 ^)-coloring. 
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5 Semicolorings 



Given the solution to the relaxed problem, our next step is to show how to "round" the solution 
to the relaxed problem in order to get a solution to the original problem. Both of the rounding 
techniques we present in the following sections produce the coloring by working through an almost 
legal semicoloring of the graph, as defined below. 

Definition 5.1 A /c-semicoloring of a graph G is an assignment of k colors to at least half its 
vertices such that no two adjacent vertices are assigned the same color. 

An algorithm for semicoloring leads naturally to a coloring algorithm as shown by the following 
lemma. The algorithm uses up at most a logarithmic factor more colors than the semicoloring 
algorithm. Furthermore, we do not even lose this logarithmic factor if the semicoloring algorithm 
uses a polynomial number of colors (which is what we will show we use) . 

Lemma 5.1 // an algorithm A can ki-semicolor any i-vertex subgraph of graph G in randomized 
polynomial time, where ki increases with i, then A can be used to 0{k n \ogn)-color G. Furthermore, 
if there exists e > such that for all i, ki = £l(i e ), then A can be used to color G with 0(k n ) colors. 

Proof: We show how to construct a coloring algorithm A' to color any subgraph H of G. A' 
starts by using A to semicolor H. Let S be the subset of vertices that have not been assigned 
a color by A. Observe that IS"! < \V(H)\/2. A' fixes the colors of vertices not in S, and then 
recursively colors the induced subgraph on S using a new set of colors. 

Let Q be the maximum number of colors used by A' to color any i-vertex subgraph. Then Cj 
satisfies the recurrence 

C-i < Cj/2 + ki 

It is easy to see that this any q satisfying this recurrence, must satisfy c% < fcjlogi. In particular 
this implies that c n < 0(k n logn). Furthermore for the case where ki = £l(i e ) the above recurrence 
is satisfied only when q = 0(/cj). ■ 

Using the above lemma, we devote the next two sections to algorithms for transforming vector 
colorings into semicolorings. 



6 Rounding via Hyperplane Partitions 

We now focus our attention on vector 3-colorable graphs, leaving the extension to general k for later. 
Let A be the maximum degree in a graph G. In this section, we outline a randomized rounding 
scheme for transforming a vector 3-coloring of G into an 0(A logs 2 )-semicoloring, and thus into an 
0(A log 3 2 log n)-coloring of G. Combining this method with a technique of Wigderson [42| yields 
an O(n ' 386 )-coloring of G. The method is based on that of [^] and is weaker than the method we 
describe in the following section; however, it introduces several of the ideas we will use in the more 
powerful algorithm. 

Assume we are given a vector 3-coloring {vi}f =1 . Recall that the unit vectors Vi and Vj associated 
with an adjacent pair of vertices i and j have a dot product of at most —1/2, implying that the 
angle between the two vectors is at least 27r/3 radians (120 degrees). 

Definition 6.1 Consider a hyperplane H . We say that H separates two vectors if they do not lie 
on the same side of the hyperplane. For any edge {i,j} G E, we say that the hyperplane H cuts 
the edge if it separates the vectors Vi and Vj . 
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In the sequel, we use the term random hyperplane to denote the unique hyperplane containing 
the origin and having as its normal a random unit vector v uniformly distributed on the unit sphere 
S n . The following lemma is a restatement of Lemma 1.2 of Goemans- Williamson [21]. 



Lemma 6.1 (Goemans- Williamson [21]) Given two vectors at an angle of 9, the probability 



that they are separated by a random hyperplane is exactly 9 /it. 

We conclude that given a vector 3-coloring, for any edge {i,j} £ E, the probability that a 
random hyperplane cuts the edge is exactly 2/3. It follows that the expected fraction of the edges in 
G that are cut by a random hyperplane is exactly 2/3. Suppose that we pick r random hyperplanes 
independently. Then, the probability that an edge is not cut by one of these hyperplanes is (l/3) r , 
and the expected fraction of the edges not cut is also (l/3) r . 

We claim that this gives us a good semicoloring algorithm for the graph G. Notice that r 
hyperplanes can partition $t n into at most 2 r distinct regions. (For r < n this is tight since r 
hyperplanes can create exactly 2 r regions.) An edge is cut by one of these r hyperplanes if and 
only if the vectors associated with its end-points lie in distinct regions. Thus, we can associate a 
distinct color with each of the 2 r regions and give each vertex the color of the region containing its 
vector. The expected number of edges whose end-points have the same color is (l/3) r m, where m 
is the number of edges in E. 

Theorem 6.2 // a graph has a vector 3-coloring, then it has an 0(A log 3 2 )- semicoloring that can 
be constructed from the vector 3-coloring in polynomial time with high probability. 

Proof: We use the random hyperplane method just described. Fix r = 2 + |~log 3 A] , and note 
that (l/3) r < 1/9A and that 2 r = 0(A log 3 2 ). As noted above, r hyperplanes chosen independently 
at random will cut an edge with probability 1 — 1/9A. Thus the expected number of edges that 
are not cut is m/9A < n/18 < n/8, since the number of edges is at most nA/2. By Markov's 
inequality (cf. [pH , page 46), the probability that the number of uncut edges is more than twice 
the expected value is at most 1/2. Thus, with probability at least 1/2 we get a coloring with at 
most n/4 uncut edges. Delete one endpoint of each such edge leaves a set of 3n/4 colored vertices 
with no uncut edges — ie, a semicoloring. 

Repeating the entire process t times means that we will find a 0(A logs 2 )-semicoloring with 
probability at least 1 — 1 /2* . ■ 



Noting that log 3 2 < 0.631 and that A < n, this theorem and Lemma 5.1 implies a semicoloring 
using 0(n 0,631 ) colors. 

By varying the number of hyperplanes, we can arrange for a tradeoff between the number of 
colors used and the number of edges that violate the resulting coloring. This may be useful in some 
applications where a nearly legal coloring is good enough. 

6.1 Wigderson's Algorithm 

Our coloring can be improved using the following idea due to Wigderson [42|. Fix a threshold value 
5. If there exists a vertex of degree greater than 5, pick any one such vertex and 2-color its neighbors 
(its neighborhood is vector 2-colorable and hence 2-colorable). The colored vertices are removed 
and their colors are not used again. Repeating this as often as possible (or until half the vertices 
are colored) brings the maximum degree below 5 at the cost of using at most 2n/8 colors. At this 
point, we can semilcolor the remainder with 0(<5 0,631 ) colors. Thus, we can obtain a semicoloring 
using 0(n/5 + 8°' 6S1 ) colors. The optimum choice of 5 is around n 0,613 , which implies a semicoloring 
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using O(n ' 387 ) colors. This semicoloring can be used to legally color G using 0(n°' 387 ) colors by 
applying Lemma [5~ll . 

Corollary 6.3 A 3-colorable graph with n vertices can be colored using 0(n°' 387 ) colors by a poly- 
nomial time randomized algorithm. 

The bound just described is (marginally) weaker than the guarantee of a 0(n 0,375 ) coloring due 
to Blum |J. We now improve this result by constructing a semicoloring with fewer colors. 



7 Rounding via Vector Projections 



In this section we start by proving the following more powerful version of Theorem p.2| . A simple 



application of Wigderson's technique to this algorithm yields our final coloring algorithm. 

Theorem 7.1 For every integer function k = k(n), a vector k-colorable graph with maximum 
degree A can be semi-colored with at most 0(A 1 ~ 2 / k \/ln A) colors in probabilistic polynomial time. 

As in the previous section, this has immediate consequences for approximate coloring. 

To prove Theorem [O], given a vector /c-coloring, we show that it is possible to extract an 
independent set of size n(n/(A 1 - 2 / k y/]nA)). If we assign one color to this set and recurse on the 
rest of the vertices, we will end up using 0(A 1_2 /*v1nA) colors in all to assign colors to half 
the vertices and the result follows. To find such a large independent set, we give a randomized 
procedure for selecting an induced subgraph with n' vertices and m' edges such that E\n' — m'\ = 
Q(n/(A 1 ^ 2 / fe \/ln A)). It follows that with a polynomial number of repeated trials, we have a high 
probability of choosing a subgraph with n' —m! = Q(n/ (A 1 ~ 2 / fc v / m~A)). Given such a graph, we can 
delete one endpoint of each edge, leaving an independent set of size n' — m! = 0(n/(A 1 ~ 2 / fc v / m~A)), 
as desired. 

We now give the details of the construction. Suppose we have a vector /c-coloring assigning unit 
vectors V{ to the vertices. We fix a parameter c = c^a to be specified later. We choose a random 
n-dimensional vector r according to a distribution to be specified soon. The subgraph consists of 
all vertices i with Vi-r > c. Intuitively, since endpoints of an edge have vectors pointing away from 
each other, if the vector associated with a vertex has a large dot product with r, then the vector 
corresponding to an adjacent vertex will not have such a large dot product with r and hence will 
not be selected. Thus, only a few edges are likely to be in the induced subgraph on the selected set 
of vertices. 

To complete the specification of this algorithm and to analyze it, we need some basic facts about 
some probability distributions in 3? n . 



7.1 Probability Distributions in 9ft n 

Recall that the standard normal distribution has the density function (f>(x) = -j=e~~ x ' 2 / 2 with 



distribution function $>(x), mean 0, and variance 1. A random vector r = (n, . . . ,r n ) is said to 
have the n-dimensional standard normal distribution if the components rj are independent random 
variables, each component having the standard normal distribution. It is easy to verify that this 
distribution is spherically symmetric, in that the direction specified by the vector r is uniformly 



distributed. (Refer to Feller [14, v. II], Knuth v. 2], and Renyi |39| for further details about 
the higher dimensional normal distribution.) 
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Subsequently, the phrase "random ci-dimensional vector" will always denote a vector chosen 
from the <i-dimensional standard normal distribution. A crucial property of the normal distribution 
which motivates its use in our algorithm is the following theorem paraphrased from Renyi (3{|] (see 
also Section III.4 of Feller pi v. II]). 



Theorem 7.2 (Theorem IV. 16.3 ||39[| ) Let r = (ri, . . . ,r n ) be a random n- dimensional vector. 
The projections of r onto two lines l\ and £2 are independent (and normally distributed) if and 
only if l\ and £2 ar ^ orthogonal. 

Alternatively, we can say that under any rotation of the coordinate axes, the projections of r 
along these axes are independent standard normal variables. In fact, it is known that the only 
distribution with this strong spherical symmetry property is the n-dimensional standard normal 
distribution. The latter fact is precisely the reason behind this choice of distribution^ in our 
algorithm. In particular, we will make use of the following corollary to the preceding theorem. 

Corollary 7.3 Let u be any unit vector in ffl 1 . Let r = (ri,...,r n ) be a random vector (of 
i.i.d. standard normal variables). The projection of r along u, given by dot product (u, r), is dis- 
tributed according to the standard (1- dimensional) normal distribution. 

It turns out that even if r is a random n-dimensional unit vector, the above corollary still holds 
in the limit: as n grows, the projections of r on orthogonal lines approach (scaled) independent 
normal distributions. Thus using a random unit vectors for our projection turns out to be equivalent 
to using random normal vectors in the limit, but is messier to analyze. 

Let N(x) denote the tail of the standard normal distribution. I.e., 



N{x) = / <f>(y) dy. 

J X 

We will need the following well-known bounds on the tail of the standard normal distribution. (See, 



for instance, Lemma VII. 2 of Feller [14, v. I].) 
Lemma 7.4 For every x > 0, 

T 1\ „ 1 



<K X ) ( o ] < N(x) < 0(x) • 

\X X 6 J 



X 



Proof: The proof is immediate from inspection of the following equations relating the three 
quantities in the desired inequality to integrals involving (j)(x), and the fact <j)(x)/x is finite for 
every x > 0. 



N(x) = / 4>(y)dy, 

J X 

4>{x)-- x = J%(y)(l + ^j dy. 



1 Readers familiar with physics will see the connection to Maxwell's law on the distribution of velocities of molecules 
in 5R 3 . Maxwell started with the assumption that in every Cartesian coordinate system in 9i 3 , the three components 
of the velocity vector are mutually independent and had expectation zero. Applying this assumption to rotations of 
the axes, we conclude that the velocity components must be independent normal variables with identical variance. 
This immediately implies Maxwell's distribution on the velocities. 
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7.2 The Analysis 



We are now ready to complete the specification of the coloring algorithm. Recall that our goal is to 
repeatedly strip away large independent sets from the graph. We actually set an easier intermediate 
goal: find an induced subgraph with a large number n' of vertices and a number ml <C n' of edges. 
Given such a graph, we can delete one endpoint of each edge to leave an independent set on n' — ml 
vertices that can be colored and removed. 

As discussed above, to find this sparse graph, we choose a random vector r and take all vertices 
whose dot product with r exceeds a certain value c. Let the induced subgraph on these vertices 
have n' vertices and ml edges. We show that for sufficiently large c, n' S> ml and we get an 
independent set of size roughly n . Intuitively, this is true for the following reason. Any particular 
vertex has some particular probability p = p(c) of being near r and thus being "captured" into our 
set. However, if two vertices are adjacent, the probability that they both land near r is quite small 
because the vector coloring has placed them far apart. 

For example, in the case of 3-coloring, when the probability that a vertex is captured is p, the 
probability that both endpoints of an edge are captured is roughly p 4 (this is counter the intuition 
that the probability should go as p 2 , and follows from the fact that we force adjacent vertices to 
be far apart — see below). It follows that we end up capturing (in expectation) a set of pn vertices 
that contains (in expectation) only p 4 m < p 4 An edges in a degree-A graph. In such a set, at least 
pn — p 4 An of the vertices have no incident edges, and thus form an independent set. We would like 
this independent set to be large. Clearly, we need to make p small enough to ensure p 4 An <C pn, 
meaning p <C A -1 / 3 . Taking p much smaller only decreases the size of the independent set, so it 
turns out that our best choice is to take p ~ A _1 / 3 /2, yielding an indpendent set of size 
Repeating this capture process many times therefore achieves an (^(A 1 / 3 ) coloring. 

We now formalize the intuitive argument. The vector r will be a random n-dimensional vector. 
We precisely compute the expectation of n', the number of vertices captured, and the expectation 
of m', the number of edges in the induced graph of the captured vertices. We first show that when 
r is a random normal vector and our projection threshold is c, the expectation of n' — ml exceeds 
n(N(c) — AiV(ac)) for a certain constant a depending on the vector chromatic number. We also 

2 

show that N(ac) grows roughly as N(c) a . (For the case of 3-coloring we have a = 2, and thus if 
N(c) = p, then N{ac) ~ p 4 .) By picking a sufficiently large c, we can find an independent set of size 
0,(N(c)). (In the following lemma, n' and ml are functions of c: we do not make this dependence 
explicit.) 

Lemma 7.5 Let a = y ^^p • Then for any c, 

E[n> -m']>n (n(c) - . 

Proof: We first bound E [n'\ from below. Consider a particular vertex i with assigned vector 
Uj. The probability that it is in the selected set is just P • r > c\. By Corollary |7.3| , V{ ■ r is 
normally distributed and thus this probability is N(c). By linearity of expectations, the expected 
number of selected vertices E [n'] = nN{c). 

Now we bound E [ml] from above. Consider an edge with endpoint vectors v\ and t>2- The 
probability that this edge is in the induced subgraph is the probability that both endpoints are 
selected, which is 

P [v\ ■ r > c and v% • r > c] < P [(vi + V2) • r > 2c] 



9 



N 



V\ + V 2 

\vi + V2W 
2c 

\\vi + v 2 \ 



r > 



2c 



where the expression follows from Corollary 7.3 applied to the preceding probability expression. 
We now observe that 



N + v 2 \\ = \Jv\ + v1 + 2vi ■ v 2 



< 



2/(fc-l) 

= yj2(k-2)/(k-l) 

= 2/0. 

It follows that the probability that both endpoints of an edge are selected is at most N(ac). If the 
graph has maximum degree A, then the total number of edges is at most nA/2. Thus the expected 
number of selected edges, E [m 1 ], is at most nAN(ac)/2. 
Combining the previous arguments, we deduce that 

E [ri - rri] > nN(c) - nAN(ac)/2. 



We now determine a c such that AN(ac) < N(c). This will give us an expectation of at least 



N(c)/2 in the above lemma. Using the bounds on N(x) in Lemma 7.4, we find that 



N(c) 
N(ac) 



> 



e- a2c2 /2/ ac 



a 1-4 e (« 2 -D C 2 /2 



> V2\l 



The last equation holds since a = y/2(k-l)/(k - 2) > V2. Thus if we choose c so that 1 — 1/c 2 > 
and e( a I 2 > A, then we get AA^(ac) < N(c). Both conditions are satisfied, for sufficiently 
large A, if we set 

(For smaller values of A we can use the greedy A + 1-coloring algorithm to get a color the graph 
with a bounded number of colors, where the bound is independent of n.) 

For this choice of c, we find that the independent set that is found has size at least 



E[n'-m'] > nN(c)/2 



1 1 



> n 



n 



A^VhiA 



as desired. This concludes the proof of Theorem 7.1 
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7.3 Adding Wigderson's Technique 

To conclude, we now determine absolute approximation ratios independent of A. This involves 
another application of Wigderson's technique. If the graph has any vertex of large degree, then 
we use the fact that its neighborhood is large and is vector {k — l)-chromatic, to find a large 
independent set in its neighborhood. If no such vertex exists, then the graph has small maximum 



degree, so we can use Theorem 7.1 to find a large independent set in the graph. After extracting 
such an independent set, we recurse on the rest of the graph. The following lemma describes the 
details, and the correct choice of the threshold degree. 

Lemma 7.6 For every integer function k = k(n), any vector k-colorable graph on n vertices can 
be semicolored with 0{n l ~^/( k+1 ^ log 1 / 2 n) colors by a probabilistic polynomial time algorithm. 

Proof: Given a vector fc-colorable graph G, we show how to find an independent set of size 
f2(n 3 /( fc+1 )/ log 1 / 2 n) in the graph. Assume, by induction on k, that there exists a constant c > 
s.t. we can find an independent set of size ci 3 ^ k ' / (log 1 ' 2 i) in any fc'-vector chromatic graph on 
i nodes, for k' < k. We now prove the inductive assertion for k. 

Let Afc = Afe(n) = n k ^ k+l \ If G has a vertex of degree greater than Afc(n), then we find 
a large independent set in the neighborhood of G. By Lemma [4.3| , the neighborhood is vector 
(k — l)-colorable. Hence we can find in this neighborhood, an independent set of size at least 
c(A fc ) 3 / fc /(log 1/2 A fc ) > cn 3 /( fc+1 )/(log 1 / 2 n). If G does not have a vertex of de gree greater than 
Afc(n), then by Theorem 7.1, we can find an independent set of size at least cn/(A fc ) 1 ~ 2 / fc /log 1/2 A k > 



cn 3 /( fc+1 )/ log 1 / 2 n in G. This completes the induction. 

By now assigning a new color to each such independent set, we find that we can color at least 
n/2 vertices, using up at most 0(n 1-3//( - fc+1 ) log 1 / 2 n) colors. I 



The semicolorings guaranteed by Theorem 71 and 76 can be converted into colorings using 
Lemma |5.1[ , yielding the following theorem. 

Theorem 7.7 Any vector k-colorable graph on n nodes with maximum degree A can be colored, in 
probabilistic polynomial time, using min{0(A 1 " 2 / fe Vln A log n), 0(n 1-3 /( fe+1 )\/hin)} colors. 

8 Duality Theory 

The most intensively studied relaxation of a semidefinite programming formulation to date is the 
Lovasz -^-function p3| , p4| , 33]. This relaxation of the clique number of a graph led to the first 



polynomial-time algorithm for finding the clique and chromatic numbers of perfect graphs. We 
now investigate a connection between $ and a close variant of the vector chromatic number. 

Intuitively, the clique and coloring problems have a certain "duality" since large cliques prevent 
a graph from being colored with few colors. Indeed, it is the equality of the clique and chromatic 
numbers in perfect graphs that lets us compute both in polynomial time. We proceed to formalize 
this intuition. The duality theory of linear programming has an extension to semidefinite pro- 
gramming. With the help of Eva Tardos and David Williamson, we have shown that in fact the 
i?-function and a close variant of the vector chromatic number are semidefinite programming duals 
to one another and are therefore equal. 

We first define the variant. 

Definition 8.1 Given a graph G = (V,E) on n vertices, a strict vector fc-coloring of G is an 
assignment of unit vectors U{ from the space 3ft" to each vertex i £ V , such that for any two 



11 



adjacent vertices i and j the dot product of their vectors satisfies the equality 

1 



(Ui,Uj) 



k-1 



As usual we say that a graph is strictly vector fc-colorable if it has a strict vector ^-coloring. 
The strict vector chromatic number of a graph is the smallest real number k for which it has a 
strict vector /c-coloring. It follows from the definition that the strict vector chromatic number of 
any graph is lower bounded by the vector chromatic number. 

Theorem 8.1 The strict vector chromatic number of G is equal to $(G). 

Proof: The dual of our strict vector coloring semidefinite program is as follows (cf. 0): 

maximize — 2_. pa 

where {Pij} is positive semidefinite 
subject to ^^Pij > 1 

Pij = Pji 

Pij = for (i,j) $ E and i ^ j 

By duality, the value of this SDP is —\/{k — 1) where k is the strict vector chromatic number. Our 
goal is to prove k = ■d. As before, the fact that {pij} is positive semidefinite means we can find 
vectors such that p« = (vj , Vj) . The last constraint says that the vectors v form an orthogonal 
labeling i.e. that (ui,Vj) = for $ E. We now claim that the above optimization problem 
can be reformulated as follows: 

maximize — - 

over all orthogonal labelings {vi}. To see this, consider an orthogonal labeling and define /x = 
Yli^j fyi ) v j) ■ Note this is the value of the first constraint in the first formulation of the dual (that 
is, the constraint is \x < 1) and of the denominator in the second formulation. Then in an optimum 
solution to the first formulation, we must have fj, = 1, since otherwise we can divide each Vi by 
and get a feasible solution with a larger objective value. Thus the optimum of the second 
formulation is at least as large as that of the first. Similarly, given any optimum {vi} for the second 
formulation, V{j ' JJl forms a feasible solution to the first formulation with the same value. Thus the 
optima are equal. We now manipulate the second formulation. 

max — ; = max 



Ei^j (Vi , Vj) " ' & , Vj) ~T,(Vi, Vi) 

Ei,j(vi,Vj) -T,(vi,vl 



-1 



mm ■ 



J2i,j(vi,Vj) 

mm 



max— ^ - 
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It follows from the last equation that the strict vector chromatic number is 



max ■ 



T,( v i, v i) 

However, by the same argument as was used to reformulate the dual, this is equal to problem of 
maximizing J2i,j(pii v j) over an orthogonal labelings such that J2fyii v i) — 1- This is simply Lovasz's 
$3 formulation of the i9-function p3, page 287]. I 



9 The Gap between Vector Colorings and Chromatic Numbers 

The performance of our randomized rounding approach seems far from optimum. In this section 
we ask why, and show that the problem is not in the randomized rounding but in the gap between 
the original problem and its relaxation. We investigate the following question: given a vector k- 
colorable graph G, how large can its chromatic number be in terms of k and n? We will show that 
a graph with chromatic number nP^ 1 ' can have bounded vector chromatic number. This implies 
that our technique is tight in that it is not possible to guarantee a coloring with colors on all 
vector 3-colorable graphs. 

Definition 9.1 The Kneser graph K(m, r, t) is defined as follows: the vertices are all possible r-sets 
from a universe of size m; and, the vertices Vi and Vj are adjacent if and only if the corresponding 
r-sets satisfy \Si n Sj\ < t. 



We will need following theorem of Milner [36] regarding intersecting hypergraphs. Recall that 



a collection of sets is called an antichain if no set in the collection contains another. 

Theorem 9.1 (Milner) Let Si, . . ., S a be an antichain of sets from a universe of size m such 
that, for all i and j, 



Then, it must be the case that 



\Si n Sj\ > t. 



/ m 
a < I m+m 



Notice that using all (/-sets, for q = (m + 1 + l)/2, gives a tight example for this theorem. 

The following theorem establishes that the Kneser graphs have a large gap between their vector 
chromatic number and chromatic numbers. 

Theorem 9.2 Let n = (™) denote the number of vertices of the graph K(m,r,t). For r = m/2 
and t = m/8, the graph K(m,r,t) is vector 3-colorable but has chromatic number at least n - 0113 . 

Proof: We prove a lower bound on the Kneser graph's chromatic number x by establishing an 
upper bound on its independence number a. It is easy to verify that the a in Milner's theorem is 
exactly the independence number of the Kneser graph. To bound x observe that 

n 

X > - 
a 

> o 

( m ) 

\{m+t)/2) 
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I m \ 
_ \m/2> 
- I m \ 
V9m/16/ 

2 m (l -o(l)) 

~ 2 (l-o(l))m((9/16) lg(16/9)+(7/16) lg(16/7)) 
> 2' 0113m for large enough m. 

In the above sequence, the fourth line uses the approximation 

2m (-^ig^-(i-^)i g (i- W/ ^_ j 




\(3m. 

for every (3 G (0, 1), where is a constant depending only on [3. Using the inequality 

«=(:)< 2™, 

we obtain m > lg n and thus 

X > (1.007864) 1 *" = n^ 1 - 007864 « n a0113 . 

Finally, it remains to show that the vector chromatic number of this graph is 3. This follows by 
associating with each vertex v\ an m-dimensional vector obtained from the characteristic vector of 
the set Si. In the characteristic vector, +1 represents an element present in Si and —1 represents 
elements absent from Si. The vector associated with a vertex is the characteristic vector of Si 
scaled down by a factor of \fm to obtain a unit vector. Given vectors corresponding to sets Si and 
Sj, the dot product gets a contribution of — 1/m for coordinates in SiASj and +l/m for the others. 
(Here AAB represents the symmetric difference of the two sets, i.e., the set of elements that occur 
in exactly one of A or B.) Thus the dot product of two adjacent vertices, or sets with intersection 
at most t, is given by 

1 _ 2\SjASj\ = 1 _ 2(|5 i | + |g J -|-2|5 i n5 j |) < 1 _ 4r - At = 
m m ~ m 

This implies that the vector chromatic number is 3. I 
More refined calculations can be used to improve this bound somewhat. 

Theorem 9.3 There exists a Kneser graph K(m, r, t) that is 3-vector colorable but has chromatic 
number exceeding n °- 016101 ) where n = ("*) denotes the number of vertices in the graph. Further, for 
large k, there exists a Kneser graph K(m, r, t) that is k-vector colorable but has chromatic number 
exceeding n - 0717845 . 

Proof: The basic idea is to improve the bound on the vector chromatic number of the Kneser 
graph using an appropriately weighted version of the characteristic vectors. We use weights a and 
— 1 to represent presence and absence, respectively, of an element in the set corresponding to a 
vertex in the Kneser graph, with appropriate scaling to obtain a unit vector. The value of a that 
minimizes the vector chromatic number can be found by differentiation and is 

mr mt 

A = -l + 



r 2 — rt r 2 — rt 
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Setting a = A proves that the vector chromatic number is at most 

m(r — t) 
r 2 — mt 

At the same time, using Milner's Theorem proves that the exponent of the chromatic number is at 
least 



1 



(m-t)log^ + (m + t)log^ 
2((m-r)log^ ? +rlogf 



By plotting these functions, we have shown that there is a set of values with vector chromatic 
number 3 and chromatic number at least n - 016101 . p OI large constant vector chromatic numbers, 
the limiting value of the exponent of the chromatic number is roughly 0.0717845. I 



10 Conclusions 

The Lovasz number of a graph has been a subject of active study due to the close connections be- 
tween this parameter and the clique and chromatic numbers. In particular, the following "sandwich 
theorem" was proved by Lovasz |}3] (see Knuth |32| for a survey). 



v(G) < < x{G). (1) 

This led to the hope that the following question may have an affirmative answer. Do there exist e, 
e' > such that, for any graph G on n vertices 

^ < u(G) < &(G) < X (G) < fl(G) x n 1 -'? (2) 

Our work in this paper proves a weak but non-trivial upper bound on the the chromatic number of 
G in terms of $(G). However, this is far from achieving the bound conjectured above and subsequent 
to our work, two results have ended up answering this question negatively. Feige |l(| has shown 
that for every e > 0, there exist families of graphs for which x(G) > $(G)n e . Interestingly, 
the families of graphs exhibited in Feige's work use the construction of Section ^ as a starting 



point. Even more conclusively, the results of Hastad 26] and Feige and Kilian Jig] have shown that 
no polynomial time computable function approximates the clique number or chromatic number to 
within factors of n 1_<E , unless NP=RP. Thus no simple modification of the i? function is likely to 
provide a much better approximation guarantee. 

In related results, Alon and Kahale || have also been able to use the semidefinite programming 
technique in conjunction with our techniques to obtain algorithms for computing bounds on the 
clique number of a graph with linear-sized cliques, improving upon some results due to Boppana and 
Halldorsson |l(| . Independent of our results, Szegedy ^l| has also shown that a similar construction 
yields graphs with vector chromatic number at most 3 that are not colorable using n 05 colors. 
Notice that the exponent obtained from his result is better than the one in Section ^. Alon Q has 
obtained a slight improvement over Szegedy's bound by using an interesting variant of the Kneser 
graph construction. The main algorithm presented here has been derandomized in a recent work 
of Mahajan and Ramesh p5| ]. By combining our techniques with those of Blum Q, Blum and 
Karger |BK97 ] have given a 3-coloring algorithm with approximation ratio (5(n 3 / 14 ). 
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